Sequencing by Hybridization – A Simulation Study of Performance on Genomic Sequences
نویسندگان
چکیده
Sequencing by Hybridization (SBH)[1,2] is a theoretical method for de-novo sequencing of DNA by means of reconstruction of the sequence from its hybridization pattern. Typically, arraying a complete set of k-mers is considered, although different setups, such as degenerate probe arrays, are also possible [3,5]. While this method is not competitive with current biochemical sequencing methods, variations of the model may be practical for problems such as SNP genotyping. Formally, given a DNA sequence S of length n and a probe length k (where k<<n), hybridization of S on an array of all possible k-mers defines a hybridization pattern ) (S k σ . Given ) (S k σ , the length n, and the k-prefix and k-suffix of S, SBH algorithms attempt to reconstruct the sequence S [4]. Even when assuming perfect and stringent hybridization SBH still suffers from an inherent information problem. For any given k, there exist sequences that cannot be uniquely reconstructed by SBH. Specifically, sequences that contain a double alternating repeat of (k-1)-mers ( e d c b a β α β α ) or a triple repeat ( d c b a γ γ γ ) cannot be uniquely reconstructed. Therefore, for a given k, the probability of success of SBH in uniquely reconstructing randomly generated sequences decreases rapidly as sequence length, n, increases. Since, from the information point of view, (k-1)-mer repeats are the basis of the failure mechanism we expect performance to be even worse on genomic sequences. In this report, we describe an analysis by simulations of SBH performance on genomic sequences. We specifically test the extent to which increasing order Markov models of the data explain SBH poor performance on genomic sequences.
منابع مشابه
Development of an Alu-PCR Amplified YAC Probe Suitable for Enumeration of Chromosome 13 on Uncultured Lymphocytes and Amniocytes by Fluorescence in situ Hybridization
The main objective of the present study was to develop an efficient and reliable probe to be routinely used for detection of chromosome 13 copy numbers by interphase FISH. To achieve this, a Yeast Artificial Chromosome (YAC) containing sequences specific for human 13q12 (744D11), was cultured and the whole yeast genomic DNA was extracted. The human insert within the isolated DNA was amplified b...
متن کاملHybridization of Samand Vehicle with Minimum Mechanical Modifications
In this paper an idea for hybridization of conventional vehicles has proposed. The case study performed on one of the common vehicles on country roads i.e. Samand. This vehicle has high production volume but low fuel performance therefore hybridization of it could be attractive for its manufacture. This paper aims that the hybridization idea and its structure to need minimum mechanical modifica...
متن کاملدورگهسازی در محل؛ اصول و کاربردها : مقاله مروری
In situ hybridization (ISH) is a method that uses labeled complementary single strand DNA or RNA to localize specific DNA or RNA sequences in an intact cell or in a fixed tissue section. The main steps of ISH consist of: probe selection, tissue or sample preparation, pre-hybridization treatment, hybridization and washing, detection and control procedure. Probe selection is one of the important ...
متن کاملThree-component Distillation Columns Sequencing: Including Configurations with Divided-wall Columns
In the present work, the exergy analysis and economic study of 3 different samples of threecomponent mixtures have been investigated (ESI>1, ESI≈1, and ESI<1). The feed mixture has been tested under three different compositions (low, equal, and high contents of the intermediate component). A quantitative comparison between simple and complex configurations, considering thermally coupled, thermo...
متن کاملMolecular identification of Dicrocoelium dendriticum using 28s rDNA genomic marker and its histopathologic features in domestic animals in western Iran
Introduction: Dicrocoeliasis is a common disease of bile ducts and gallbladder of domestic and wild ruminants. This disease is caused by different species of dicrocoelium including Dicrocoelium dendriticum. The aim of this study was to identify pathological damages and molecular features associated with this parasite in ruminants. Materials and Methods: In this cross-sectional study, 180 fresh...
متن کامل